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ABSTRACT 

Trypanosomatids, such as the sleeping sickness 
parasite Trypanosoma brucei, contain a ~20S 
RNA-editing complex, also called the editosome, 
which is required for U-insertion/deletion editing of 
mitochondrial mRNAs. The editosome contains a 
core of 12 proteins including the large interaction 
protein A1, the small interaction protein A6, and 
the editing RNA ligase L2. Using biochemical and 
structural data, we identified distinct domains of 
T. brucei A1 which specifically recognize A6 and 
L2. We provide evidence that an N-terminal 
domain of A1 interacts with the C-terminal domain 
of L2. The C-terminal domain of A1 appears to be 
required for the interaction with A6 and also plays a 
key role in RNA binding by the RNA-editing ligase L2 
in trans. Three crystal structures of the C-terminal 
domain of A1 have been elucidated, each in complex 
with a nanobody as a crystallization chaperone. 
These structures permitted the identification of 
putative dsRNA recognition sites. Mutational 
analysis of conserved residues of the C-terminal 
domain identified Arg703, Arg731 and Arg734 as 
key requirements for RNA binding. The data show 
that the editing RNA ligase activity is modulated by a 
novel mechanism, i.e. by the frans-acting RNA 
binding C-terminal domain of A1. 

INTRODUCTION 

Trypanosomatids are the causative agent of human tryp- 
anosomiasis, Chagas disease and leishmaniasis which are 
important diseases in tropical and subtropical regions of 



the world. Trypanosomatids undergo remarkable changes 
in their morphology and energy generation during their 
insect and mammalian bloodstream life cycles that is 
coupled to mitochondrial gene expression (1-3). The mat- 
uration of most mitochondrial genes in trypanosomes 
requires major RNA-editing steps (4-6). In trypano- 
somatids, RNA editing is a post-transcriptional modifica- 
tion which alters the mitochondrial mRNA transcripts by 
insertion or deletion of uridylates (U insertion/deletion 
RNA editing). This process is mediated by small RNAs 
known as guide RNAs (gRNAs) (7-9). RNA-editing steps 
are catalyzed by several multi-enzyme complexes, a major 
one of which is called the '~20S editosome and also 
editosome'. In Trypanosoma brucei, 20 proteins have been 
found to be associated with the editosome. U-insertion/ 
deletion RNA editing is a very complicated process which 
involves a cascade of enzymatic steps (10-12). 

There are three distinct types of ~20S editosomes 
(13-15), which share a common core of 12 proteins 
including four enzymes (i.e. the RNA-editing ligase 
1 (LI), RNA-editing ligase 2 (L2), the U-specific 3'- 
exouridylylase X2 and the 3'-terminal uridylyl-transferase 
T2) and six 'interaction proteins': Al, A2, A3, A4, A5 and 
A6. (For editosome protein nomenclature, see 
e.g. (11,16,17).) Many of these enzymes and interaction 
proteins have been shown to be essential for the 
function of the editosome (18-26). The six interaction 
proteins vary greatly in length, but each contains a pre- 
dicted oligonucleotide-binding (OB)-fold near the 
C-terminus (27). Comprehensive interaction mapping 
and mass spectrometry studies revealed that the enzymes 
are networked by specific associations with interaction 
proteins (14,17). Four editosome interaction proteins 
(A1-A4) have been shown to interact with A6 suggesting 
the formation of 'a five OB-fold center' in the core of the 
editosome that functions as a scaffold for the entire 
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editosome (15,17,28,29). The largest interaction 
protein Al contains two zinc-finger motifs followed by a 
C-terminal OB-fold (27). The interactions between 
full length Al and other editosome proteins, including 
A6 and L2, have been reported before (17). However, 
which domains of Al are responsible for interacting 
with these two other editosome proteins was unknown 
so far. 

The two editing RNA ligases LI and L2 share a high 
degree of amino acid sequence identity (30,31) and belong 
to the nucleotidyltransferase (NTase) superfamily, which 
also includes ATP and NAD-dependent DNA ligases as 
well as eukaryotic mRNA capping enzymes (30). The 
ligases catalyze the sealing of the 3'-hydroxyl group and 
the 5'-phosphate group of DNA or RNA termini via a 
series of nucleotidyl transfer reactions (32-34). All DNA 
ligases and mRNA capping enzymes contain a common 
catalytic core structure composed of a NTase domain and 
an OB-fold domain (32,35-37). 

DNA (or RNA) substrate recognition by ligases is a key 
step of the ligation reaction. It has been shown that the 
ligase remains bound to DNA (or RNA) in most of the 
steps of the reaction (35,38-41). Based on several DNA 
ligase crystal structures, the OB-fold domain of the ligase 
is required for the initial auto-adenylation step 
(33,37,42,43). This domain plays a major role in DNA 
binding and is implicated in specific substrate recognition 
(35,38,39,44,45). Interestingly, unlike most DNA ligases 
and mRNA capping enzymes, the RNA ligase family, 
including T4 RNA ligase 1 (Rnll), T4 RNA ligase 
2 (Rnl2) and RNA-editing ligases LI and L2, lacks an 
OB-fold domain (30,40,46). 

The RNA-editing ligases LI and L2 are distinguished 
from other DNA or RNA ligases in that they are 
embedded in the editosome, where interaction proteins 
improve RNA ligation efficiency by an unknown mechan- 
ism (15,20). It has been reported that full length Al and 
A2 stimulate the auto-adenylation and ligation activity of 
L2 and LI, respectively (15,47). However, which domains 
of interaction proteins interact with RNA-editing ligases, 
and how interaction proteins Al and A2 stimulate 
the activity of RNA-editing ligases remained unknown 
so far. 

Through a combination of biochemical and structural 
studies, we have identified the functions of two domains 
of the interaction protein Al. An N-terminal domain 
of Al appears to interact with the C-terminal 
region of L2. The C-terminal OB domain of Al (Al OB ) 
links Al to A6 and therefore to the OB-fold center in the 
core of the editosome (29). Importantly, the 
C-terminal OB domain of Al stimulates nicked dsRNA 
substrate binding by L2 in vitro. It appears that the 
A1«L2 complex has two RNA-binding modules 
participating in dsRNA recognition: the NTase domain 
of L2 and the OB-fold domain of Al, each of which 
alone fails to bind tightly to nicked dsRNA. We discovered 
that three basic residues of the Al OB are essential for RNA 
binding by the A1«L2 complex. It is therefore most likely 
that, also in the editosome, the C-terminal domain of Al 
assists RNA-binding by the editing RNA ligase L2 in trans. 



MATERIALS AND METHODS 

Cloning of T. bvucei Al OB .A6 and T. hrucei A1.L2 

The binary complex of T. brucei editosome proteins Al 
and A6 was produced using a co-expression strategy as 
described previously (17). The gene for T. brucei A6 
(residues 20-164) and the gene encoding residues 626- 
762 of T. brucei Al (Al OB ) were cloned into the 
bi-cistronic expression vector pRSF. A construct for ex- 
pressing a variant of Al OB was made with the 38-residue 
loop L23, comprising residues 658-695, replaced by a 
linker GASG. The protein expressed by this construct is 
hereafter called A1° BA . 

For the expression and purification of wild-type 
T. brucei A1»L2 biliary complexes, the gene for 
T. brucei L2 (residues 21-416) was cloned into pACYC 
without any tag (hereafter called full length L2). The gene 
for T. brucei Al (residues 196-762) was cloned into 
pSKB3 for expression with an N-terminal His 6 -tag 
followed by a tobacco etch virus (TEV) protease 
cleavage site. Full length Al A658-695 (hereafter called 
A1 A ) construct with 38 residues in loop L23 replaced by 
GASG was subcloned into a pACYC vector. Full length 
Al variants with glutamate substitution in R703, K715, 
K719, R731, R734, K741 and R742 were introduced by 
the quick change site-directed mutagenesis method. Wild- 
type, truncations, L23 deletion and point mutants of Al 
and L2 were prepared as summarized in the 
Supplementary Methods. 

Crystallization of the T. brucei Al OBA » A1 Nbl0 complex 

Anti-Al nanobodies ( A1 Nbs) were generated and used as 
crystallization chaperones. Crystals were grown via the 
sitting-drop vapor diffusion method by mixing a Al OBA 
•A6« A1 Nbl0 protein solution with an equal volume of 
reservoir solution. Crystals of Al OBA with A1 Nbl0 protein 
were grown in three different conditions. Form I was 
obtained from 0.05 M ammonium sulfate, 0.05 M 
sodium acetate trihydrate pH 5.0, 30% w/v polyethylene 
glycol (PEG) 2000 MME, 0.2 M sodium thiocyanate, 
belonged to space group V2{1{1 and diffracted to 2.0 A; 
Form II from 0.2 M magnesium chloride, 0.1 M bis-tris 
pH 5.5, 25% w/v PEG 3350, belonged to space group C2 
and diffracted to 2.65 A; and, Form III from 0.05 M 
ammonium sulfate, 0.05 M sodium acetate trihydrate pH 
5.0, 30% w/v PEG 2000 MME, 5% v/v jeffamine M600, 
belonged to space group C222 and diffracted to 2.7 A. 
Selenomethionyl derivative protein crystals in Form I 
were also obtained. Detailed information on mutagenesis, 
crystallization, structure determination and refinement, 
production of anti-Al OB nanobodies and RNA-binding 
assays is included in the Supplementary Methods. 
Crystallographic data collection and refinement statistics 
are shown in Supplementary Table SI. 

RESULTS 

Identification of the region of Al involved in binding A6 

A series of Al truncation mutants was constructed and 
used to identify the region of Al involved in binding to 
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A6, we co-expressed both protein domains in prey 
(without any tag) and bait (His 6 -tag) position, respect- 
ively, resulting in a stable interaction in both cases 
(Figure IB). The presence of His 6 -tagged A6 and Al OB 
was also verified by TEV protease cleavage and anti-Al 
immunopurification, respectively (Figure IB, lane 5; 
Supplementary Figure S3). These findings indicate that 
Al is the domain from Al responsible for the inter- 
action with A6. 
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A6. Al truncation mutants were designed to avoid disrup- 
tions in predicted domains (27). Using pull-down assays, 
interactions of 10 different truncated Al variants 
with His 6 -tagged wild-type A6 were tested. Al with all resi- 
dues prior to amino acid 626 removed, i.e. Al OB , is still 
capable of binding to A6 (Figure 1A and Supplementary 
Figure SI). In contrast, any Al truncation without 
Al OB failed to interact with A6 suggesting that the 
OB-fold of Al is the only A6-interacting domain of Al 
(Supplementary Figure S2). In order to obtain direct 

evidence of interaction between the Al OB and wild-type Three crystal structures of Al OBA in complex 

with A1 NblO 

In order to obtain the three-dimensional structure of 
Al OB , we used several approaches in parallel such as 
replacing the non-conserved loop L23 by a 4-residue 
linker (Supplementary Figure SI). However, neither 
Al OB nor Al OBA crystals could be obtained. Also 
complexes of Al OB or Al OBA with A6 failed to give 
suitable crystals. Given these difficulties, anti-Al 
nanobodies were generated and used as crystallization 
chaperones, a strategy which had previously been success- 
fully used in crystallizing challenging proteins by our 
collaborating laboratories (16,29,48-50). An Al OBA 
•A6» A1 NblO ternary complex could be obtained 
(Supplementary Figure S3) indicating that A1 NblO does 
not block A6 binding by Al OBA . Crystallization of the 
Al OBA »A6 in complex with A1 NblO yielded three crystal 
forms (I, II and III). Surprisingly, in all three crystal 
forms, the asymmetric unit contained two copies of 
Al OBA and two of A1 NblO indicating that A6 dissociated 
during crystallization. The three structures were refined at 
2.0, 2.7 and 2.65 A resolution, with R tree values of 22.9, 
26.4 and 26.7%, respectively (Supplementary Table SI). 
The structures of the six Al BA subunits in the three 
crystal forms superpose mutually with root-mean-square 
deviations 0.6 ~ 0.8 A (Supplementary Figure S4). In the 
following, we therefore describe the structure of crystal 
Form I, since this is the structure with the highest reso- 
lution (Figure 2). 

Although Al OB has a low level of sequence similarity 
with the previously determined crystal structures of 
editosome OB domains of A6 and A3 OB (16,29), the 
overall structure of Al OBA shares the conserved OB-fold 
architecture (Figure 2 and Supplementary Figure S5). 
Superimposing the Al OBA monomer onto A6 (PDB-ID: 
3K7U) results in a root-mean-square deviation of 1.5 A 
for 80 equivalent Ca atoms with 29% amino acid 
sequence identity. The superposition shows the conserved 
core p-barrel composed of six p strands (Supplementary 
Figure S5). A significant difference is the absence of helix 
al in Al BA (Figure 2A and Supplementary Figure S5). 
No electron density was observed for the putative helix al 
linking P3 and P4 in any of the six Al OB subunits in our 
three crystal structures, suggesting that residues which 
form usually this helix in other OB folds, are flexible in 
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Figure 1. T. hrucei Al and A6 interact with each other. (A) The 
T2-binding domain (T2BD) and OB-fold domain arrangement in Al. 
A schematic representation of full length Al and full length A6 with 
their interaction domains is shown with the T2BD of Al in blue and 
the OB folds of Al and A6 in gold. Direct interactions, as identified by 
bacterial co-expression and purification, are shown as a red polygon. 
The T2-binding domain (T2BD) of Al had been identified before (17). 
(B) Co-transformation and expression of Al B and A6 constructs. 
T. brucei A6 containing a His 6 -tag at the N-terminus co-transformed 
into Escherichia coli cells with Al OB (residues 626-762) was 
co-expressed. Cells were lysed and the His 6 -tagged A6»Al OB complex 
was captured by incubating the lysate with Ni-NTA beads (lane 2-4). 
The His 6 -tag peptide was cleaved off by TEV. This loss of the His 6 -tag 
peptide is evident in the reduced size of the now untagged A6 (lane 5). 
Abbreviations used in this and in the following figures: Cell, total lysate 
from induced cells; SF, soluble fraction; E, Ni-NTA elution fractions; 
E + TEV, eluate from first Ni-NTA after treatment with TEV protease; 
TEV, tobacco etch virus (TEV) protease. The positions and sizes (kDa) 
of marker polypeptides are indicated on the left. 



A1 OBA 

Al OBA dimerization involves the side-by-side 
arrangement of P strands from the two subunits result- 
ing in a contiguous six-stranded P-sheet with in the 
center an anti-parallel pair of N-terminal pi strands 
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Figure 2. The crystal structure of T. brucei Al OBA in complex with A1 NblO. (A) Overall structure of the Al OBA » A1 NblO complex. The structure is 
shown as a ribbon diagram with the Al OBA domain and A1 NblO depicted in green and gold, respectively. Selected secondary structure elements are 
labeled. The approximate location of the p-surface is indicated with a solid line. Two disordered regions are shown with dashed lines. 
(B) Electrostatic surface representations for Al OBA . The Al OBA domain is shown with its electrostatic surface charge and the nanobody A1 NblO 
as a ribbon diagram. The electrostatic potential surface of Al BA was calculated using APBS (51). Regions with potentials above +5 k b Te c ~' and 
below ~5 kbTe c _ are shown in blue and red, respectively. The locations of several conserved basic residues are indicated. (C) A side view of the 
complex shows that the basic residues on the surface of the Al BA are facing away from nanobody binding site. 



(Supplementary Figure S6). The three Al homodimers 
in the three new crystal structures form different dimers 
(Supplementary Figure S6), with the dimers of crystal 
Forms I and II being quite similar to each other but dis- 
tinctly different from the dimer in crystal Form III. The 
latter is closer in structure to that of canonical OB-fold 
dimers, while there is a gap between the neighboring (31 
strands from the two monomers in crystal Forms I and II. 
When superimposing one subunit of crystal Form I or II 
onto a subunit of Form III, it appears that the second 
subunit in Form I or II is rotated by 8.6° and shifted by 
2 A compared to the second subunits in Form III 
(Supplementary Figure S6B). Size exclusion chromato- 
graphy studies reveal an equilibrium of Al OB monomers 
and dimers in solution (Supplementary Figure S7) which, 
combined with the different dimers in our crystal struc- 
tures, suggests that there might be a mixture of monomers 
and multiple types of dimers in solution. 

The charge distribution on the surface of Al OBA reveals 
a distinctive feature on the surface of Al OBA : a large posi- 
tively charged region with contributing charged side 
chains mainly from strands P3, (34, |34' and (35' (Figure 2 
and Supplementary Figure SI). Interestingly, seven 
conserved basic residues are responsible for this highly 
positively charged surface. An alignment of the primary 
structures of Al OB domains from various species 
reveals that the R703, K715, R731, R734 and R742 side 
chains are identical in all of the Al proteins from these 
species (Supplementary Figure SI). Two additional 
residues, K719 and K741 of T. brucei are either a Lys 
or and Arg in all trypanosomes species. This conserved 
positively charged region could play a critical role in 
the recognition of negatively charged dsRNA substrates 
or other editosome proteins, as will be further investigated 
below. 

Identification of interacting domains of Al and L2 

To identify which domains of Al and L2 interact with 
each other, truncated forms of Al and L2 were con- 
structed and analyzed in pull-down assays after 



co-expression as described above. For this purpose, we 
generated expression systems for only the NTase 
domain of L2, comprising residues 21-285, and various 
N-terminal truncations of L2 with the NTase domain 
removed. We also constructed various non-tagged 
domain variants of Al. Co-expression and Ni-NTA pull- 
down experiments of untagged Al variants with His 6 - 
tagged L2 variants showed interesting results 
(Figure 3B). The C-terminal region of L2, spanning 
residues 308^116, was capable of interacting with the 
N-terminal region (residues 196-331) of Al (Figure 3B). 
No interactions between Al (residues 335-762) and full 
length L2 (see Supplementary Figure S8), and neither 
between full length Al and the L2 NTase domain were 
observed (data not shown). These results suggest that 
amino acids 196-331 of Al and amino acids 308^116 of 
L2 are functionally important for the interaction of Al 
and L2. Hence, residues 308^116 of L2 are hereafter 
called the A 1 -binding domain or L2 A1BD , and residues 
196-331 of Al the L2-binding domain or A1 L2BD 
(Figure 3A). 

Al promotes dsRNA binding by the RNA-editing 
ligase L2 

Based on Al's association with L2, we hypothesized that 
domains from both Al and L2 might be required to bind 
a nicked gRNA:mRNA duplex in the RNA ligation 
process. To test this hypothesis, a native gel mobility 
shift assay was used to examine the binding the nicked 
dsRNA substrates to length variants of L2 and Al. 
Binding reactions were performed in the absence of ATP 
so as to preclude conversion of substrate to product 
during the incubation. 

To determine the role of different parts of L2 in nicked 
dsRNA recognition, two length variants of L2 were made 
and co-expressed with full length Al. The first L2 variant 
retains residues 21-416 (i.e. comprises the NTase and 
A1BD domains) of L2, while the second contained only 
the A1BD of L2. Full length L2 in complex with full 
length Al readily binds dsRNA, but the L2 A1BD alone 
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Figure 3. Identification of interaction domains of T. brucei Al and L2. 
(A) A1 L2BD and L2 A1BD interact with each other. A schematic repre- 
sentation of the domains of full length Al and full length L2, i.e. the 
L2-binding domain (L2BD), the T2-binding domain (T2BD). and the 
OB-fold domain (OB-Fold) of Al, and the nucleotidyl-transferase 
domain (NTase) and the Al-binding domain (AlBD) of L2. The 
direct interaction between the L2BD and the AlBD, as identified by 
bacterial co-expression and purification (see Figure 3B), is shown as a 

and 

(residues 196-331) and L2 A1BD 
(residues 308^416) were co-expressed in E. coli and co-purified by 
Ni-NTA chromatography via the His 6 -tag of L2 (lanes 2^1). The 
His 6 -tag of L2 was removed with TEV protease. This loss of the 
His 6 -tag of L2 is indicated by an arrow (lane 5). The positions and 
sizes (kDa) of marker polypeptides are indicated on the left (lane 1). 
See Figure IB for abbreviations. 



in complex with full length Al does not interact with 
dsRNA (Figure 4A). Since the L2 NTase domain alone 
does also not interact with dsRNA (Figure 4B), this indi- 
cates that both the NTase domain and parts of Al are 
essential for RNA binding by the L2»A1 complex. 

To determine the roles of different parts of Al in inter- 
acting with L2 and RNA, three different domain variants 
of Al were made, A l( L2BD+T2BD+OB >, A 1< L2BD+T2BD > and 
A1 (L2BD) . Subsequently, these domain variants were 
co-expressed with full length L2, and tested for dsRNA 
binding (Figure 4C). Al (L2BD+T2BD+OB) in complex with 
full length L2 did bind to dsRNA, but full length L2 in 
complex with A1 (L2BD+T2BD) without the C-termmal 
OB-fold domain did not. In addition, A1 L2BD in 
complex with full length L2 lost overall RNA-binding cap- 
ability. These results suggest that the OB-fold domain of 
Al is essential for dsRNA binding by the AUL2 complex. 



Interestingly, the Al B domain by itself failed to bind 
tightly to nicked dsRNA (Figure 4D). As expected, the 
L2 A1BD , A1 L2BD and Al B domains individually did 
also not display any RNA-binding capability (data not 
shown). These observations suggest that the Al OB 
domain and the L2 NTase domain bind nicked dsRNA in 
a cooperative manner. 

Discovering critical residues of Al OB in dsRNA binding 
by the complex of the RNA ligase L2 and Al 

We next asked which area of Al OB assists L2 in dsRNA 
binding. We first tested the possible function in RNA 
binding of the flexible L23 loop of Al OB which was 
deleted in our constructs in order to obtain diffraction 
quality crystals (Figure 5A and Supplementary Figure 
SI). Interestingly, the removal of the 38 amino acid 
residues of L23 from Al did not affect its ability to bind 
dsRNA (Figure 5B). 

In the structure of Al OBA in complex with A1 NblO, 
~800 A 2 solvent accessible area of Al OBA (13% of its 
total accessible area) is buried by the A1 NblO nanobody 
(Figure 5A). It was of interest to determine whether Al° 
residues involved in nanobody binding had any RNA rec- 
ognition function in vitro. Figure 5C shows that the 
AUL2» A1 NblO complex was able to bind dsRNA, i.e. 
the nanobody did not prevent Al»L2's ability to bind 
dsRNA, and hence the surface of Al OB occupied by 
A1 NblO is not playing a significant role in RNA binding 
by the A1»L2 heterodimer. 

We next examined the role of the strictly conserved, 
solvent exposed basic residues found in a closely spaced 
cluster of positively charged residues, referred to as 'the 
arginine cluster' (Figures 5A and 6). To probe whether 
any of the conserved side chains in this region are func- 
tionally relevant, each of the seven basic residues R703, 
K715, K719, R731, R734, K741 and R742 of Al OB were 
replaced by glutamate, thereby creating the potential for 
electrostatic repulsion between the OB-fold domain and 
the RNA phosphodiester backbone. Figure 5D shows 
that three Al mutants, R703E, R731E and R734E com- 
pletely lost their RNA-binding property, while no effect 
on RNA binding by other single or double mutants was 
observed. These results indicate that the cluster of three 
arginines formed by R703, R731 and R734 of Al OB plays 
a crucial role in the RNA-binding process by the A1«L2 
complex. 

To investigate whether the A1»L2 complex had any 
preference for dsRNA over dsDNA substrates, we 
compared the binding of the A1»L2 enzyme complex 
with both nicked RNA duplex substrates and nicked 
DNA duplex substrates. The A1»L2 complex has a high 
affinity for various dsRNA substrates. However, A1»L2 
complex does not have a similar high affinity for dsDNA 
(Figure 5E). 



DISCUSSION 

The studies described in this paper present major progress 
in understanding properties of domains from two key 
editosome proteins: the largest interaction protein 
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Figure 4. The C-terminal domain of T. brucei Al promotes dsRNA binding by the RNA-editing ligase L2. Electrophoretic mobility shift assays were 
used to examine the binding of A1»L2 to nicked dsRNA substrate. Wild type and variants of A1»L2 binary complexes were co-expressed 
and co-purified as explained in Supplementary Methods. A total of 30 uM double-stranded 26-mer nicked RNA (see the sequence in 
Supplementary Table S2) was incubated with 15uM proteins or protein complexes for 30min at 4°C in 20 mM Tris (pH 7.5), 2mM DTT and 
250 mM NaCl. The RNA and A1»L2 complex were resolved on a 4-15% polyacrylamide gel and stained by ethidium bromide and Coomassie Blue, 
shown, respectively, at the left and the right of A-D. Each protein and the A1»L2 constructs used for the RNA-binding assays are illustrated in a 
schematic diagram above the gels. The positions of the RNA»A1»L2 complex and of RNA alone are indicated on the left. (A) The role of different 
domains of L2 in dsRNA recognition. The RNA binding abilities of (i) full length Al (lanes 1-3) by itself, (ii) L2 A1BD in complex with full length Al 
(lanes 4-6) and (iii) full length L2 <NTase+A1BD) with full length Al (lanes 7-9), show that the NTase domain of L2 is critical for nicked dsRNA 
recognition. (B) The role of the L2 NTilse domain in dsRNA recognition. The RNA binding abilities of L2 NTase domain by itself (lanes 2-3) and of 
wild-type full length Al in complex with full length L2 (lanes 4-5), show that L2 NT!,se alone fails to bind tightly to nicked dsRNA. Lane 1 contains 
26-mer nicked dsRNA alone as a control. (C) The role of different domains of Al in dsRNA recognition. The RNA binding abilities of (i) A1 L2BD 
with full length L2 (lanes 1-2), (ii) A1 (L2BD+T2BD) with full length L2 (lanes 3^t) and (iii) Al (L2BD+T2BD+OB) with full length L2 (lanes 5-6), show that 
the Al OB domain promotes RNA binding by L2. (D) The Al OB domain alone has little affinity for dsRNA. The RNA binding abilities of 15 and 
45 uM Al OB alone (lanes 2-3) were tested. Lane 1 contains 26-mer nicked dsRNA alone as a control. 



Al and the RNA-editing ligase L2. In particular, the 
multiple functions of the OB-fold of Al are most 
intriguing including the role it plays in the action of 
the RNA-editing ligase L2. Our new findings can be 



well-incorporated into the recently proposed model of 
an OB-fold center in the core of the editosome (29). It is 
remarkable that nanobodies were again required to obtain 
well-diffracting crystals illustrating once again the power 
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Figure 5. Identification of specific arginine residues involved in dsRNA binding by the T. brucei Al domain in complex with the RNA-editing 
ligase L2. (A) Surface presentation of the Al OBA structure. Left: the structure of Al OBA , with A1 NblO omitted for clarity, is shown with (i) the 
residues involved in binding of A1 NblO in red, (ii) the completely conserved and solvent exposed basic residues R703, R731 and R734 in blue and (iii) 
the remainder of the Al° A surface in green. The approximate locations of the two disordered regions are sketched with dotted lines. Right: the 
Al OBA structure rotated ~90° with respect to the left panel illustrating the A1 NblO-binding surface of Al OB which is not engaged in RNA-binding 
(Figure 5C). (B-E) Wild-type and variants of A1»L2 binary complexes were co-expressed and co-purified. Electrophoretic mobility shift assays — as 
explained in the Legend to Figure 4 — were used to evaluate the RNA-binding capability of the set of A1»L2 variants obtained. (B) The effect of the 
deletion of L23 on RNA affinity of Al . The RNA binding abilities of wild-type A1»L2 (lanes 2 and 3) and A1 A »L2 (lanes 4 and 5) are compared. 
Lane 1 contains 26-mer nicked dsRNA alone as a control. (C) The effect of A1 NblO binding on RNA-binding activity. The RNA binding abilities of 
A1 NblO alone (lanes 2 and 3), A1»L2 (lanes 4 and 5), and A1»L2» A1 NblO (lanes 6 and 7) were compared. Lanes 1 contains 26-mer nicked dsRNA 
alone as a control. (D) Mutational effects on RNA-binding activity. The RNA-binding assay was carried out with single or double mutants of Al 
with wild-type L2. Each of seven basic residues (R703, K715, K719, R731, R734, K741 and R742) in full length Al was replaced by glutamate as 
indicated (lanes 3-8). The RNA binding abilities of wild-type and mutant A1»L2 were tested. 26-mer nicked dsRNA alone (lane 1) and wild-type 
A1»L2 (lane 2) were used as a control. (E) The preference for dsRNA over dsDNA substrates on RNA-binding activity. The binding of both nicked 
RNA duplex substrates and nicked DNA duplex substrates by recombinant A1»L2 was tested. To probe the possibility of sequence specificity on 
dsDNA recognition, two different random sequence dsDNA were also tested as a control and appeared not to bind to A1»L2 (data not shown). 
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Figure 6. Model of the OB-fold center and associated proteins in the 
core of the editosome. (A) A 'shifted heterotetramer' model of the OB 
folds of Al, A2 or A4, A3 and A6. The shifted heterotetramer 
described in (29) was modified by replacing the model structure for 
Al B by the actual structure of Al B described in this article. The 
heterotetramer depicted contains the actual structure of the A3 OB »A6 
heterodimer (29), the new structure of Al OB (Figure 5 A and 
Supplementary Figure S4) and a model of A2 OB based on the structure 
of Al OB . The A2 OB domain might be an A4 OB domain instead, as 
discussed in (29). (B) The 'shifted heterotetramer' model with four 
associated enzymes depicted schematically. The OB domains are 
colored as in (A); the editing RNA ligase L2 is shown in blue; four 
additional editosome proteins in gray. The three domains of Al are 
schematically shown interacting with partner proteins in the core: the 
A1 L2BD with the L2 A1BD , A\ TitlD domain with T2 and Al OB with A6. 
The editosome proteins schematically depicted are: TUTase 2 (T2), 
Exonuclease 2 (X2), Ligase 1 (LI), Ligase 2 (L2), interaction proteins 
Al to A6 (A1-A6) and KREPB5 (B5). Interaction data is from refer- 
ences (15,17) and from Figures 1, 3 and 4. (C). Al and L2 recognizing 
dsRNA. Color code as in (B) above with dsRNA schematically added 
in gold, and with the L2 NTase domain closer to Al OB . The A1 L2BD and 
L2 1BD domains interact with each other, thereby enabling the Al OB 
and L2 NTase domains to cooperatively bind dsRNA substrate. For 
further explanation see 'Discussion' section. 



of these crystallization chaperones which had been suc- 
cessful in many other studies in our laboratories 
(16,29,48-50). 

Functions of editosome protein domains unraveled 

By expressing, purifying and analyzing a large number of 
variants of the interaction protein Al and the RNA- 
editing ligase L2, and complexes of the two, it has 
become clear that: 

(i) The OB fold of Al is engaged in an interaction with 
A6 (Figure 1). This result is consistent with previous 
results where it was shown that A6 forms a com- 
plex with both Al (L2BD+T2BD+OB) and A l (T2BD+OB) 
(17). The A6»Al OB interaction is probably not very 
strong since, in three different crystal forms 
obtained under mild conditions, the A6 chain is 
absent and only Al OBA in complex with nanobody 
A1 NblO is observed, while the crystallization ex- 
periments started with a solution containing a 
ternary A6«Al OBA « A1 NblO complex (Supplemen- 
tary Figure S3). 

(ii) The region surrounding the putative first 
zinc-binding site of Al, spanning residues 196-331, 
is responsible for interacting with the RNA Ligase 
L2, and therefore called A1 X2BD (Figure 3). 

(hi) the C-terminal domain of L2, and comprising resi- 
dues 308^116, is involved in associating L2 with Al, 
and is therefore called L2 A1BD (Figure 3). The inter- 
actions of the A1 L2BD and the L2 A1BD are key to 
making it possible for Al OB to assist the catalytic 
domain of L2 (L2 NTase ) in performing its function 
discussed as follows. 

Since it has been shown (17) that the region surrounding 
the second putative zinc-binding site of Al, comprising 
residues 396^182, is involved in binding to the enzyme 
T2 and is called the A1 T2BD , there are now three well- 
defined regions of the large Al protein responsible for 
contacts with three different editosome proteins 
(Figure 1A). The name 'interaction protein' for Al 
appears therefore to be very well chosen. It is also of 
interest to consider the global similarities in the domain 
structures of Al and A2 (27), and the fact that full length 
A2 has been shown to bind to the editing RNA Ligase LI 
and the editosomal exonuclease X2 (17,20). Given the 
similarities in domain structure of the two large inter- 
action proteins Al and A2, combined with the fact that 
the C-terminal domain of A2 (A2 OB ) interacts with A6 
[Figures 2A, 4 A and B in (17)], it is tempting to suggest 
that the region in A2 corresponding to the L2BD of Al is 
engaged in interactions with LI, and that the region in 
Al corresponding to the T2BD of A2 is engaged in 
binding to X2. 

In other words, the order of domains with known func- 
tions in Al is L2BD-T2BD-OB and in A2 the order is 
presumably L 1 BD X2BD OB, where the X2-binding 
function of the putative X2BD of A2 still has to be 
demonstrated but the role of the L1BD of A2 and the 
interactions of the OB fold of A2 with A6 have been 
shown before (17). Given the similarities of Al and 
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A2, and of LI and L2, it is likely that the C-terminal 
domain of LI is responsible for the interactions of 
LI with A2, specifically through interactions with the 
A2 L1BD surrounding the first putative zinc-binding 
region of A2 (17,20). 

Three crystal structures of the OB fold of 
the interaction protein Al 

The structure of the Al OBA monomer appears to have, in 
all three crystal forms we studied, a number of special 
characteristics. The residues expected to form the canon- 
ical helix al between strands (53 and P4 is not represented 
by well-defined electron density in any of the six independ- 
ent copies of Al OBA in the three crystal structures 
determined. This helix has been observed to be well 
defined in over 10 other OB-fold structures reported in 
the literature (Supplementary Figure S5), and its flexibil- 
ity in Al OBA is hence highly unusual, but might be partly 
due to a conserved proline residue in this region 
(Supplementary Figure SI). The significance of this flexi- 
bility is unknown at this moment but it might be that this 
region only becomes better defined in structure when 
Al interacts with other, yet to be discovered, proteins 
in the editosome. 

While the structures of the six Al OBA monomers are 
very similar, the three dimers in our structures are differ- 
ent which is highly unusual among OB-fold dimers. The 
dimers in crystal Form I and II are deviating substantially 
from the dimer in crystal Form III, with a 2 A shift per- 
pendicular to the interacting (31 and (51' strands, and a 
9° difference in orientation of the two subunits in the 
dimers (Supplementary Figure S6), and a concomitant 
loss of several pi— pi' main chain hydrogen bonds. 
However, the interface between OB folds in a dimer is 
extensive (29), hence the Al OBA dimers are stabilized by 
the interactions in other regions, e.g. by interfaces 
involving strands P3, P4 and P5. In all three dimers, 
Al OBA interacts with A1 NblO in essentially the same 
manner, hence nanobody binding is not responsible for 
the differences in Al OBA dimer organization in the three 
crystal Forms. 

Solution studies provide evidence for a monomer-dimer 
equilibrium of the Al OB domain (Supplementary Figure 
S7), which, together with the variability of the dimer struc- 
tures observed in the three crystal structures, suggests that 
the Al OB dimer can easily be disrupted. This would be in 
agreement with its participation in the OB-fold center of 
the editosome where only one Al OB domain is expected to 
be present since, as a very large interaction protein, Al is 
most likely only represented by a single copy in the 
editosome (29,52,53). 

The OB fold of Al assists RNA binding by RNA-editing 
ligase L2 in trans 

With the increased knowledge of the functions of domains 
in the A1»L2 complex, and the new structures of Al OBA , 
we carried out several further studies of this complex to 
unravel features essential for RNA binding. It appeared 
that neither the Al OB nor the L2 NTase domains by them- 
selves bound nicked dsRNA, and nor did the complex of 



full length L2 with a variant of Al with the Al removed 
(Figure 4). Hence, Al OB is essential for dsRNA binding by 
the A1»L2 complex. In DNA ligases and RNA capping 
enzymes known so far, an OB fold from these enzymes is 
involved in nucleic acid binding in cis (35,37,39,42). It has 
also been suggested that the C-terminal non-OB-fold 
domain of bacteria phage RNA ligases is required for 
nick recognition and ligation (54,55). This C-terminal 
domain of the phage RNA ligase then also promotes 
RNA binding in cis. In contrast, the OB fold of Al is 
able to assist RNA binding by L2, due to interactions 
between the L2 A1BD and the A1 L2BD which bring Al 
and L2 together. Flexible regions between domains of 
both proteins may bring the catalytic domain of L2 and 
the OB fold of Al in close proximity to enable RNA 
binding by the A1»L2 complex. To the best of our know- 
ledge, this is the first time that an OB fold has observed to 
increase the affinity for RNA by a member of the 
nucleotidyl-transferase family via an in trans mechanism. 

Additional experiments provided important informa- 
tion which part of Al OB is crucial for its function in 
RNA binding by the A1»L2 complex. For example, the 
ternary A1»L2» 'NblO complex did interact with nicked 
dsRNA with a similar affinity as the binary A1»L2 
complex and, hence, the surface area of Al OB used for 
AI NblO binding is not critical for the interaction of 
A1»L2 with RNA (Figure 5C). It also appeared that 
AUL2 and A1 A »L2 exhibited a similar affinity for 
nicked dsRNA binding and therefore loop L23 is not a 
key element of Al for engaging RNA (Figure 5B). Next, 
the possible role of a highly charged region on the Al OB 
surface was probed by mutating each of seven Arg 
residues into glutamates. Substituting a glutamate for 
three of these seven arginines caused loss of nicked 
dsRNA binding by the A1»L2 complex (Figure 5D). 
This established that a highly positively charged surface 
region is essential for RNA binding by Al° . It is very 
likely that also in the assembled editosome the A1 L2B 
plays a critical role to by bringing together the editing 
RNA ligase L2 and the interaction protein Al, and that 
the same positively charged residues of Al OB enhance 
RNA binding by the ligase L2 in trans. 

The OB center in the core of the editosome 

It is of interest to combine A6 and RNA-binding functions 
of Al B with the proposed 'shifted heterotetramer' of the 
OB-fold center in the core of the editosome (29). When 
taking the model of the shifted heterotetramer proposed 
by these authors, and replacing the modeled Al B fold in 
that tetramer by the Al OB structure determined in this 
study, we obtain a heterotetramer based on the experimen- 
tally observed A3 OB »A6 heterodimer and the new struc- 
ture of Al OB (Figure 6A). In this heterotetramer, where 
the fourth OB fold can either be that of A2 or of A4 [for 
discussion, see (29)], the following features of Al OB with 
respect to the other three OB folds are of interest as 
follows. 

(i) The flexible region between P3 and P4, which is 
usually helical in OB folds as mentioned above, is 
not in contact with any of the three other OB folds 
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in the heterotetramer. This highly conserved 
(Supplementary Figure SI) region could be of im- 
portance in the assembled editosome to interact 
with other editosome proteins. 

(ii) The region most likely occupied by loop L23, which 
was deleted in our studies in order to obtain diffrac- 
tion quality crystals, is also accessible to solvent in 
the shifted heterotetramer (Figure 6A). Loop L23 is 
not conserved in trypanosomatids (Supplementary 
Figure SI) and hence its role in editosome 
function, if any, will require further study. 

(iii) The highly positively charged region including the 
three arginines R703, R731 and R734, demonstrated 
to be involved in RNA binding (Figure 5D), is fully 
accessible to solvent. 

The highly positively charged region on Al OB is very 
likely interacting with nicked dsRNA in the A1«L2 binary 
complex (Figure 5E), and most probably the same will 
happen, at least transiently, in the core of the editosome 
during the RNA-editing process. Consistent with this 
model with its exposed positively charged surface region 
of Al OB , the A1»A6»L2 ternary complex did bind dsRNA 
(data not shown). 

To illustrate, more clearly, the possible role of the OB 
fold center, additional domains of Al, A2 and A3 have 
been added schematically to the shifted heterotetramer of 
OB folds (Figure 6B). In the absence of RNA, the Al OB 
and the L2 N ase domains might be at a distance from each 
other, but the presence of nicked dsRNA may induce at a 
certain moment a conformational change in L2, and 
possibly also in the linkers between the different 
domains of Al, and bring the Al OB and the L2 NTase 
closer together to enhance RNA binding and to enable 
L2 to carry out the ligation reaction (Figure 6B and C). 
This results in a sealed mRNA strand which is subject to 
subsequent editing steps, except of course when this is the 
final editing reaction for this mRNA-editing site. 

Obviously, major questions still remain to be answered 
regarding the mechanism of action and the motions in the 
RNA-loaded editosome. Yet, our studies have provided 
new insights in the mechanism of RNA editing by the 
editosome, in particular by identifying (i) the domains of 
Al and L2 which interact with each other and (ii) a crit- 
ically important region for RNA binding on the surface of 
the C-terminal OB fold of Al, which acts in trans to 
enhance RNA binding by the editing ligase L2. 
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